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CLAIMS 

1 . A iterthod of preparing a mechanijcally searchable index 
with respect to a vector database in which a finite ntimber of sets 
5 each including at least N-dimensional real vector and an 

identification nvnnber of the vector are registered as vector data, 
said method comprising: 

a first step of vector index preparation of dividing N 
coitpDnents into m sets in a predetermined method with respect to 

10 the N-dimensional real vector V of each vector data in said vector 
database, pr^>aring m partial vectors v^ to v^, subsequently 
tabulating a distribution of a norm of the partial vector Vj, (k = 1 
to m) , pr^)aring a norm division table in v^ich a norm range of a 
predetermined D type norm division is determined, calculating a 

15 region nunter d to T^iiich said partial vector v^. belongs in 

accordance with predetermined D region center vectors p^ to pt,, 
tabulating a distribution of a cosine (Vj,*Pd)/( |Vh,|*|pd| ) of an angle 
formed by said partial vector v^, and the region center vector p^ as 
a declination distribution, and preparing a declination division 

20 table in wtiich a declination range of the predetermined C type 
declination division is recorded; 

a second step of the vector index preparation of dividing 
N conponents into m sets in the same method as said first step with 
respect to the N-dimensional real vector V of each vector data in 

25 said vector database, preparing m partial vectors v^ to v^^^ 
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referring to said norm divisir>n table to calculate a mmiber r of 
the nom division to which the norm of said partial vector Vb 
belongs with respect to the partial vector v^ (b = 1 to m) for the 
partial space number b, calculating the region number d to which 
5 said partial vector v^ belongs in accordance with the predetermined 
D region center vectors p^ to pt, in the same method as said first 
step, calculating a declination (Vb»Pd)/( |vi,|* |pa| ) as a cosine of an 
angle formed by said partial vector v^, and the region center vector 
Pd radicating a center direction of the region of said region 

10 number d, referring to said declination division table^ calculating 
a number c of the belonging declination division, and calculating 
index registration data to be registered in a vector index from 
said partial space number b, said region number d, said declination 
division number c, said norm division number r, the ccnpDnent of 

15 said partial vector v^, and the identification number i; and 
a third step of the vector index preparation of 
constituting the vector index such that the identification number 
and the ccn^nent of each partial vector can be searched using a 
set of the partial space nvmiber b, the region number d, the 

20 declination division number c and a norm division number range [r^, 
r2] as a key from said norm division table, said declination 
division table, and said index registration data, and such that the 
vector ccirponent of each vector data can be searched with the 
identification number of the vector conqponent- 



25 
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2 . A method of prqparing a mechanically searchable index 
with respect to a vector database in M*^ich a finite number of sets 
each including at least N-dimensional real vector and an 
identification nuinber of the vector are registered as vector data, 
5 said method ccB[5)rising: 

a first step of vector index preparation of dividing N 
ccaiponents into m sets in a predetermined method with respect to 
the N-dimensional real vector V of each vector data in said vector 
database, preparing m partial vectors v^ to v^, subsequently 

10 tabulating a distribution of a norm of the partial vector Vi, (b = 1 
to m) for each partial space nuirber b, preparing a norm division 
table in vrtiich a norm range of a predetermined D type norm division 
is determined, calculating a region number d to \A^iich said partial 
vector Vb belongs in accordance with predetermined D region center 

15 vectors p^ to p^, tabulating a distribution of a cosine 

(Vb»pd)/{ |Vb|*|Pd| ) of an angle formed by said partial vector v^, and 
the region center vector p^ as a declination distribution, and 
preparing a declination division table in TArihich a declination range 
of the predetermined C type declination division is recorded; 

20 a second step of the vector index preparation of dividing 

N ccxtponents into m sets in the same method as said first step with 
respect to the N-dimensional real vector V of each vector data in 
said vector database, preparing m partial vectors v^ to v^, 
referring to said norm division table to calculate a number r of 

25 the norm division to which the norm of said partial vector Vj, 
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belongs with respect to the partial vector (b = 1 to m) for said 
partial space b, calculating the region nunoiber d to which said 
partial vector v^ belongs in accordance with the predetermined D 
region center vectors Pi to pt in the same method as said first step, 
5 calculating a declination (Vt,*Pd)/( |vi,|*|p^| ) as a cosine of an angle 
formed by said partial vector v^, and the region center vector p^j 
indicating a center direction of the region of said region nuniber d, 
referring to said declination divisijon table, calculating a number 
c of the belonging declination division, calculating a component 

10 division number of a predetermined range to wiiich v^^ belongs 

from a maximum value of the norm of the norm division corresponding 
to said calculated norm division number r with respect to each 
con^nent v^j of said calculated partial vector v^, and calculating 
index registration data to be registered in a vector index from 

15 said partial space number b, said region number d, said declination 
division number c, said norm division number r, a string of said 
coicponent division numbers Wj, and the identification number i; and 

a third step of the vector index preparation of 
constituting the vector index such that the identification number 

20 and the cat^nent of each partial vector can be searched using a 
set of the partial space number b, the region number d, the 
declination division number c and a norm division number range [ri, 
rj] as a key from said norm division table, said declination 
division table, and said index registration data, and such that the 

25 vector coirponent of each vector data can be searched with the 
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identification niiraber of the vector con^nent* 

3. The vector index preparing method according to claim 
1 or 2 v^erein in the fixst and second st^s of said vector index 
5 preparation^ an angle cosine {^pd)/( |\^| *|pd| ) is used as a 

function of an angle formed by the partial vector vb and the region 
center vector pd, and a value of the function is used as a 
declination to obtain the declination distribution. 

10 4* The vector index preparing method according to claim 

1 or 2 v^erein in the first and second steps of said vector index 
preparation, N/m conponents or (N/m)+l ccirponents are extracted in 
order from a top component of V so that all components of an N- 
dimensional vector V are extracted, and the partial vector is 

15 prepared. 

5. The vector index pi^>aring method according to claim 
1 \n^erein in the first step of said vector index preparation, 
during preparation of the norm division table, the norm division is 

20 determined based on the tabulation result of the norm distribution 
so that the number of partial vectors belonging to the norm range 
corresponding to each norm division beccmes as uniform as possible* 

6. The vector index preparing method according to claim 
25 1 wherein in the first step of said vector index preparation. 
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during pr^>aration of the declination division table, the 
declination division is determined based on the tabulation result 
of the declination distribution so that the number of partial 
vectors belonging to the declination range corresponding to each 
5 declination division becomes as uniform as possible. 

7. The vector index preparing method according to claim 
1 or 2 wherein in the first and second steps of said vector index 
preparation, the region number of the partial vector Vi, is obtained 
10 as a number d of the region center vector p^ in \irfiich a cosine 

(Vb*Pd)/( |Vb|*|Pd| ) of an angle formed by and v^ is largest among 
the predetermined D region center vector Pi to p^- 

8* The vector index preparing method according to claim 
15 1 or 2 wherein in the third step of said vector index preparation, 
a search tree in ^Adiich a number (b*Nd*Wc*Nr)+(d*Nc*Nr)+(c*Nr)+r 
obtained by ccmbining the partial space number b, the region nurhber 
d, the declination division number c, and the norm division number 
r can be used as a key to search the identification number i and 
20 the coitpDnent of the vector, and a table in v^iich the vector data 
identification number is used as an affix and the key of said 
search tree of each partial vector is recorded are prepared and 
used as part of the vector index. 

25 9. The vector index preparing method according to claim 
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1 or 2 wherein in the second step of said vector index preparation, 
the vector otytained by normalizing all vectors <0, — , 0, +1) to 
(-1, -1) whose cottponent is any one of {-1, 0, +1} and \idiich 

are not 0 vector is used as the region center vector. 

5 

10, A similar vector searching method in v*djch a query 
vector Q of an N-diinensional real vector, an dinner product lower 
limit value cx, and maximum obtained vector number L are designated 
as search conditions, a vector index prepared from vector data with 

10 a finite rannber of sets of at least N-dimensional real vector and 

an ID number of the real vector registered therein is searched, and 
I, sets at maximum (i, V»Q) of an identification number i and an 
inner product of Q and V are obtained with respect to vector data 
(i, V) of said vector database ^^^ose value V*Q of the inner product 

15 with said query vector Q is larger than said inner product lower 
limit value a, said similar vector searching method ccxrprising: 

a first st^ of similar vector search of dividing N 
coirponents of Q into m sets in the same predetermined method as a 
method used in preparing said vector index with respect to said 

20 query vector Q, preparing m partial query vectors to q^, 

calculating a partial inner product lower limit value as a lower 
limit value of an inner product (hereinafter referred to as 
"partial iimer product) of each partial query vector q^, and the 
corresponding partial vector frcm a designated inner product lower 

25 limit value a, calculating a partial space number b, and a set (c. 
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[r^r rj]) of a declination division number c to be searched in a 
region nxmiber d and a norm division range [r^, rj] from a value of 
an inner product Pd*qb of the region center vector and said 
partial query vector qt,, said partial inner product lower lindt: 
5 value f^/ and a norm division table and a declination division 
table in said vector index with respect to each partial query 
vector q^ (b = 1 to m) and each region b, searching a range of said 
vector index using (b, d,- c, [r^, rj] ) as a search condition based 
on said calculated (c, [ri, rj]), obtaining the identification 

10 number i and the component of the partial vector v^ satisfying the 
condition as an index search result, calculating a partial inner 
product difference (Vt»<^)-fi, as a difference between a partial 
inner product v^^q^ of said Vj, and and said partial inner product 
lower limit value f^, and acciamulating (adding) the difference as 

15 an inner product difference upper limit value S[i] of the 

identification number i of an inner product difference table; and 
a second step of the similar vector search of searching 
said vector index with the identification number i in order from a 
largest value in said inner product difference table S[i] to obtain 

20 a vector data coirponent V, calculating an inner product difference 
value t = V»Q-a by subtracting a from the inner product V*Q of V 
and said query vector Q, and out^utting a set of at least the 
identification number i and an inner product t+a as a search 
result with respect to L pieces at maximum of vector data with a 

25 large inner product difference value vdien L or more pieces of 
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vector data having the ianer prcxiuct difference value larger than a 
maximuin value of an element having a non-calculated inner product 
difference value are collected, or v^en the inner products of all 
the vector data having a positive inner piDoduct difference upper 
5 limit value are calculated in said inner product difference table- 

11. A sinuJLar vector searching method in which a query 
vector Q of an N-dimensional real vector, a distance ujper limit 
value a, and maximum obtained vector number L are designated as 

10 search conditions, a vector index prepared from vector data with a 
finite number of sets of at least N-dimensional real vector and an 
identification number of the real vector registered therein is 
searched, and L sets at maximum (i, p) of an identification nvmter 
i of an N-dimsisional real vector V in said vector data and a 

15 distance p between Q and V are obtained such that a value of an 
inner product with said query vector Q is not more than said 
distance lopper limit value ct, said similar vector searching method 
conprising: 

a first step of siioilar vector search of dividing N 
20 con^nents of Q into m sets in the same predetermined method as a 
method used in preparing said vector index with respect to said 
que3:y vector Q, preparing m partial query vectors q^ to q„, 
calculating a partial square distance uj^jer limit value f^, as an 
upper limit value of a square distance |vt^-^p (i.e., square of 
25 Euclidean distance, hereinafter referred to as "partial square 
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dist:ance") of each partial query vector and the corresponding 
partial vector Vj, from a designated distance upper limit value cx, 
systetiatically generating a set (b, d, c, [r^, rj]) of a partial 
space nxunber b to be searched, a region number d, a declination 
5 division number c and a norm division range [ri, rj] from said 

partial query vector g^^ said partial square distance upper limit 
value fi,, and a norm division table and a declination division 
table in said vector index with respect to each partial query 
vector <^ (b = 1 to m), searching a range of said vector index 

10 vising said generated (b, d, c, [r^, ra]) as a search condition, 
obtaining the id^itif ication niimber i and the con^nent of the 
partial vector Vi, satisfying the condition as an index search 
result, calculating a partial square distance difference fb-lv^-^il^ 
as a difference between said partial square distance upper limit 

15 value f^ and a partial square distance |Vb-qb|^ of v^ and q^, and 
accumulating (adding) the difference as a square distance 
difference upper limit value S[i] of the identification number i of 
a square distance difference table; and 

a second step of the similar vector search of searching 

20 said vector index with the identification nuittoer i in order from a 
largest value in said square distance difference table S[i] to 
obtain a vector data coitiponent V, calculating a square distance 
difference value a^-|v-Q|^ by subtracting a square distance |v-Q|^ 
of V and said query vector Q from a squared distance upper limit 

25 value a% and outputting a set of at least the identification 
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numiber i and a distance (a^-t)^^ as a search result with respect to 
L pieces at inaxinaim of vector data with a large square distance 
difference value t vdien L or more pieces of vector data having the 
square distance difference value larger than a maxiraum value of an 
5 element having a non-calculated square distance difference value 

are collected, or when the squcore distance difference values of all 
the vector data having a positive square distance difference upper 
limit value are calculated in said square distance difference table. 

10 12. The similar vector searching method according to 

claim 10 or 11 v^erein in the first step of said similar vector 
search, N/m ccinponents or (N/m)+l ccarponents are extracted in order 
from a top ccannponent of v so that all ccmponents of an N- 
dimensional vector V are extracted, and the partial query vector is 

15 prepared. 

13 • HhB similar vector searching method according to 
claim 11 vrtierein in the first st^ of said similar vector search, 
the partial inner product lower limit value f^ as the lower limit 
20 value of the inner product of said partial query vector q, and the 
corresponding partial vector v^ is calculated from a designated 
inner product lower limit value a by f ^ = ot|(^|^/2{ jq^]^) • 



25 



14. The similar vector searching method according to 
claim 11 vy*ierein in the first step of said similar vector search. 
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the partial square distance upper limit value fj^, as the upper limit 
value of the square distance of said partial query vector q^^ and 
the corresponding partial vector v^, is calculated from a designated 
distance lower/upper liinit value a by f ^ = a^|qbr/2( l<i,|^)- 

5 

15. An apparatxjs for preparing a mechanically searchable 
index with respect to a vector database in v4iich a finite nuiriber of 
sets each including at least N-dimensional real vector and an 
identification number of the vector are registered as vector data, 

10 said apparatus ccirprising : 

partial vector calculation means for dividing N 
components into m sets in a predetermined inethod with respect to 
the N-dimensional real vector V of each vector data in said vector 
database, and preparing m partial vectors v^ to v^^; 

15 norm distribution tabulation means for tabulating a 

distribution of a norm of the partial vector v^ (k = 1 to m) among 
said pr^>ared m partial vectors v^ to v„, and preparing a norm 
division table in v^ch a norm range of a predetenained D type rK>rm 
division is determined; 

20 region nvnrber calculation means for calculating a region 

number d to which said partial vector Vj, belongs in accordance with 
predetermined D region center vectors pj to p^; 

declination distribution tabulation means for tabulating 
a distribution of a cosine (v,,»Pd)/( |Vk|*|pj| ) of an angle formed by 

25 said partial vector v^ and the region center vector p^ as a 
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declination distribution, and preparing a declination division 
table in which a declination range of the predetermined C type 
declination division is recorded; 

norm division number calculation means for referring to 
5 said norm division table to calculate a number r of the norm 

division to vdiich the norm of said partial vector Vj, belongs with 
respect to the partial vector Vj^ (b = 1 to m) for the partial space 
number b among the m partial vectors v^ to v^ prepared by said 
partial vector calculation means; 

10 declination division number calculation means for 

calculating a declination (Vb»Pd)/( |vb|*|pd| ) as a cosine of an angle 
formed by said partial vector Vj, and the region center vector p^ 
indicating a center direction of the region of said region number d 
calculated by said region number calculation means; 

15 index data calculation means for calculating index 

registration data to be registered in a vector index from said 
partial space number b, said region number d, said declination 
division number c, said norm division number r, the conponent of 
said partial vector Vj,, and the identification number i; and 

20 index constituting means for constituting the vector 

index such that the identification number and the component of each 
partial vector can be searched using a set of the partial space 
number b, the region number d, the declination division number c 
and a norm division number range [r^, rj as a key frcm said norm 

25 division table, said declination division table, and said index 
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registration data, and such that the vector component of each 
vector data can be searched with the identification niimber of the 
vector component. 



5 16. An apparatus for preparing a mechanically searchable 

index with respect to a vector database in wliich a finite nuiriber of 
sets each including at least N-dinaensional real vector and an 
identification ninnber of the vector are registered as vector data, 
said apparatus ccuprising: 

10 partial vector calculation means for dividing N 

components into m sets in a predetermined method with respect to 
the N-dimensional real vector V of each vector data in said vector 
database, and preparing m partial vectors v^ to v„; 

norm distribution tabulation means for tabxilating a 

15 distribution of a norm of the partial vector v^, (b — 1 to m) for a 
partial space nunfcer b among said prepared m partial vectors Vi to 
v^f and preparing a norm division table in which a norm range of a 
predetermined D type norm division is determined; 

region number calculation means for calculating a region 

20 number d to \K^ch said partial vector v^, belongs in accordance with 
predetermined D region center vectors p^ to pt>7 

declination distribution tabulation means for tabulating 
a distribxition of a cosine (Vb*Pd)/( | v^l *|Pd| ) of an angle formed by 
said partial vector v^ and the region center vector p^ as a 

25 declination distribution, and preparing a declination division 
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table in Tftdiich a declination range of the predetermined C type 
declination division is recorded; 

norm division niamber calculation means for referring to 
said norm division table to calculate a number r of the norm 
5 division to which the noim of said partial vector Vj, belongs with 

respect to the partial vector v^^ (b = 1 to m) for a partial space b 
among the m partial vectors v^ to v^^ prepared by said partial vector 
calculation means; 

declination division nuniber calculation means for 
10 calculating a declination (Vj,»Pd)/( |Vb| *|p^i| ) as a cosine of an angle 
formed by said partial vector v^ and the region center vector p^ 
indicating a center direction of the region of the region number d 
calculated by said region number calcvilation means; 

connponent division number calculation means for 
15 calculating a conponent division number w^ of a predetermined range 
to \rtAiich Vj^ belongs from a maximum value of the norm of the norm 
division corresponding to said calculated norm division nvimber r 
with respect to each carponent Vj^ of said calculated partial vector 
v^; 

20 index data calculation means for calculating index 

registration data to be registered in a vector index from said 
partial space number b^ said region number d, said declination 
division number c, said norm division number r, a string of said 
component division numbers Wj, and the identification number i; and 

25 index constituting means for constituting the vector 
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index such that the identif icaticn number and the catiponent of each 
partial vector can be searched using a set of the partial space 
number b, the region number d, the declination division number c 
and a norm division nuirber range [ri, rj] as a key from said norm 
5 division table, said declination division table, and said index 
registration data, and such that the vector coinponent of each 
vector data can be searched with the identification number of the 
vector component - 

10 17 ♦ The vector index preparing apparatus according to 

claim 15 or 16 herein said partial vector calculation means 
extracts N/m components or (N/m)+l components in order from a top 
component of V so that all conponents of an N-dimensional vector V 
are extracted, and prepares the partial vector. 

15 

18. The vector index preparing apparatus according to 
claim 15 v^erein during preparation of the norm division table said 
norm distribution tabulation means determines the norm division 
based on the tabulation result of the norm distribution so that the 
20 number of partial vectors belonging to the norm range corresponding 
to each norm division becomes as uniform as possible - 



25 



19. The vector index preparing apparatus according to 
claiai 15 v;herein during preparation of the declination division 
table, said declination distribution tabulation means determines 
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the declination division based on the tabulation result of the 
declination distribution so that the number of partial vectors 
belonging to the declination range corresponding to each 
declination division beoones as uniform as possible. 

5 

20. The vector index preparing apparatus according to 
claim 15 or 16 \fl^erein said region number calculation means obtains 
the region nuntoer of the partial vector Vj, as a number d of the 
region center vector p^ in v^ch a cosine (Vb»Pd)/( |v^,|*|p^| ) of an 

10 angle formed by p^ and v^ is largest among the predetermined d 
region center vector p^ to p^- 

21. The vector index preparing apparatus according to 
claim 15 or 16 wherein said index constituting means prepares a 

15 search tree in v^ch a number (b*Wd*l*::*Nr)+(d*Nc*Nr)+(c*Kr)+r 

obtained by combining the partial space number b, the region number 
d, the declination division number c, and the norm division number 
r can be used as a kaey to search the identification number i and 
the corrfonent of the vector, and a table in vjhich the vector data 

20 identification number is used as an affix and the key of said 

search tree of each partial vector is recorded, and uses the search 
tree and the table as a part of the vector index. 



22- The vector index preparing apparatus according to 
25 claim 15 or 16 v^erein said region number calculation means uses 
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the vector obtained by normalizing all vectors (0, , 0, -i-l) to 

(-1, — , -1) v^ose ooitponent is any one of <-l, 0, +1} and which 
are not 0 vector as the region center vector. 



5 23. A similar vector searching apparatus for designating 

a query vector Q of an N-dimensional real vector, an inner product 
lower limit value a, and maximum obtained vector number L as 
search conditions, searching a vector index prepared from vectx>r 
data with a finite number of sets of at least N-dimensional real 
10 vector and an ID number of the real vector registered therein, and 
obtaining L sets at maxirwom (i, V»Q) of an identification number i 
and an inner product of Q and V with respect to vector data (i, V) 
m of said vector database whose value V»Q of the inner product with 

said query vector Q is larger than said inner product lower limit 
15 value a, said similar vector searching apparatus CGir5)risijng: 

partial query condition calculation means for dividing N 
components of Q into m sets in the same predetermined method as a 
method used in preparing said vector index with respect to said 
query vector Q, preparing m partial query vectors q^ to q^, and 
20 calculating a partial inner product lower limit value f ^ as a lower 
limit value of an inner product (hereinafter referred to as 
"partial inner product) of each partial query vector qt, and the 
corresponding partial vector from a designated inner product lower 
limit value a; 

25 se£ux:h object range generation means for calculating a 
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partial space nimnber b, and a set (c, [r^, rj]) of a declination 
division nmnber c to be searched in a region nmriber d and a norm 
division range [r^, r^] from a value of an inner product p/qb of the 
region center vector p^ and said partial query vector said 
5 partial inner product lower limit value f^, and a nozm division 
table and a declination division table in said vector index with 
respect to each partial query vector q^ (b = 1 to m) and each 
region b; 

index search means for searching a range of said vector 
10 index using (b, d, c, [r^, rs]) as a search condition based on (c, 
[^1/ ^il) calculated by said search object range generation means, 
and obtaining the identification number i and the coinponent of the 
partial vector v^ satisfying the condition as an index search 
result; 

15 inner product difference upper limit calculation means 

for calculating a partial inner product difference (Vb»<^)-fi, as a 
difference between a partial inner product Vi,«(j, of said v^, and <^ 
and said partial inner product lower limit value f^r and 
accumulating (adding) the difference as an inner product difference 

20 upper limit value S[i] of the identification number i of an inner 
product difference table; and 

similarity search result determination means for 
searching said vector index with the identification number i in 
order from a largest value in said inner product difference table 

25 S[i] to obtain a vector data carponent V, calculating an inner 
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product difference value t = V«Q-a by subtracting a from the junner 
prcxiuct V'Q of V and said query vector Q, and outputting a set of 
at least the identification number i and an inner product t-kx as a 
search result with respect to L pieces at maximuni of vector data 
5 with a large inner product difference value viiien L or more pieces 
of vector data having the inner product difference value larger 
than a maxirauia value of an elenent having a non-calculated inner 
product difference value are collected, or when the inner products 
of all the vector data having a positive inner product difference 
10 upper limit value are calculated in said inner product difference 
table. 

24. A similar vector searching apparatus for designating 
a query vector Q of an N-dimensional real vector, a distance upper 

15 limit value a, and maximum obtained vector number L as search 

conditions, searching a vector index prepared from vector data with 
a finite number of sets of at least N-dimensional real vector and 
an identification number of the real vector registered therein, and 
obtaining L sets at maximum (i, p) of an identification number i of 

20 an N-dimensional real vector V in said vector data and a distance p 
between Q and V such that a value of an inner product with said 
query vector Q is not more than said distance u^^r limit value a, 
said similar vector searching apparatus ccanr^rising: 

partial query condition calculation means for dividing N 

25 ccxnponaats of Q into m sets in the same predetermined method as a 



method used in preparing said vector index with respect to said 
query vector Q, preparing m partial query vectors qi to c^, 
calculating a partial square distance upper limit value fb as an 
upper limit value of a square distance Iv^-c^l^ <i*e., square of 
Euclidean distance, hereinafter referred to as ''partial square 
distance'') of each partial query vector <j, and the corresponding 
partial vector Vj, frcm a designated distance upper limit value a; 

search object range generation means for systematically 
generating a set (b, d, c, [r^, ra]) of a partial space number b to 
be searched, a region nvimber d, a declination division number c and 
a norm division range [r^, r^] from said partial query vector q^,, 
said partial square distance upper limit value f^, and a norm 
division table and a declination division table in said vector 
index with respect to said partial query vector (b = 1 to m) ; 

index search means for searching a range of said vector 
index using (b, d, c, [r^, rj ) generated by said search object 
range generation means as a search condition, and obtaining the 
identification namber i and the component of the partial vector v^ 
satisfying the condition as an index search result; 

square distance difference upper limit calculation means 
for calculating a partial square distance difference fb-|vi,-qb|^ as a 
difference between said partial square distance upper limit value 
fb and a partial square distance Iv^-qbl^ of Vb and <i^, and 
accumulating (adding) the difference as a square distance 
difference upper limit value S[i] of the identification number i of 



a sqaare distance difference table; and 

similarity search result determination means for 
searching said vector index with the identification number i in 
order from a largest value in said square distance difference table 
S[i] to obtain a vector data cdrpon^t V, calculating a square 
distance difference value a^-|v-Q|^ by subtracting a square 
distance |v-qP of V and said query vector Q from a squared 
distance upper limit value a% and outputting a set of at least the 
identification number i and a distance (a^-t)^^ as a search result 
with respect to I, pieces at maximum of vector data with a large 
square distance difference value t when L or more pieces of vector 
data having the square distance difference value larger than a 
maximum value of an elonent having a non-calculated square distance 
difference value are collected, or when the square distance 
difference values of all the vector data having a positive square 
distance difference upper limit value are calculated in said square 
distance difference table. 

25. The similar vector searching apparatus according to 
claim 23 or 24 vAierein said partial query condition calculation 
means extracts N/m coanpon^ts or (N/m)+l coitponents in order frcan a 
top conponent of V so that all components of an N-dim^sional 
vector V are extracted, and prepares the partial query vector. 

26. The similar vector searching apparatus according to 
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claim 23 vtfierein the partial inner product lower limit value fj, as 
the lower limit value of the inner product of said partial query 
vector and the corresponding partial vector is calculated frcxn 
a designated inner product lower limit value a by = 
5 a|<i|V2(|q,r). 

27. llie similar vector searching apparatus according to 
claim 24 wherein the partial square distance upper limit value fj, 
as the upper limit value of the square distance of said partial 
10 query vector and the corresponding partial vector v^ is 

calculated frcm a designated distance lower/ipper limit value a by 
f, = a^|c|.r/5:(|q,r). 

28* A recording medium in vrfiich a coirputer program for 
15 executing the method of claim 1 or 2 is recorded. 

29. A recording medium in v^ch a con^uter program for 
realizing the apparatiis of claim 15 or 16 by software is recorded. 



